THE ISSUE OF PHOTONS IN DIELECTRICS: 
HAMILTONIAN VIEWPOINT 



S. ANTOCI AND L. MIHICH 

Abstract. The definition of tlie pfioton in tlie vacuum of general rela- 
tivity provided by Kermack et al. and by Synge is extended to nondis- 
persive, nonhomogeneous, isotropic dielectrics in arbitrary motion by 
Hamiltonian methods that rely on Gordon's effective metric. By these 
methods the old dilemma, whether the momentum-energy vector of the 
photon in dielectrics is timelike or spacelike in character, is shown to 
reappear under a novel guise. 



1. Introduction 

Despite a widespread conviction, the concept of photon in vacuo does not 
pertain exclusively to quantum physics. Classical general relativity has its 
say on the subject, as it was beautifully shown long ago by Kermack et 
al. jl] and by Synge [2]. By availing of Hamiltonian methods, and of the 
reduction to the vacuum case of the geometrical optics of dielectrics operated 
by Gordon [3] and by Pham Mau Quan [3] , the problem of the definition of 
the photon in dielectrics is considered anew in the present paper. 

The abstract formulation [5] of Hamilton's theory of rays and waves is 
first recalled (Section 2). The metric is then introduced, and the partic- 
ular Hamiltonians that apply to the case of the vacuum (Section 3), and 
of isotropic, nondispersive dielectrics (Section 4) are considered. Synge's 
derivation of the proportionality between energy and frequency for the pho- 
ton in the vacuum of general relativity is then displayed (Section 5). His 
argument, when applied to the case of dielectrics, leads to a twofold pos- 
sibility: one must choose either a timelike photon (Section 6), for which 
however the proportionality between energy and frequency breaks down in 
nonhomogeneous dielectrics, or a spacelike photon (Section 7) for which the 
proportionality is always ensured. The latter option however requires, at 
variance with what occurs in mechanics, that an entity endowed with a 
spacelike momentum-energy vector be associated to a timelike ray. The ex- 
istence of this twofold possibility is akin to the ancient, unsolved dilemma in 
the electromagnetism of continua, whether the energy tensor of the electro- 
magnetic field in dielectrics should be given either the expression proposed 
by Abraham ^ or the one formulated by Minkowski [7]. 
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2. Hamilton's theory in abstract form 

Like Maxwell's equations, also the equations of Hamilton's theory of rays 
and waves [5] lie very deep in the conceptual structure of physics. In fact, 
as it occurs with Maxwell's equations [8], these equations can be written 
without availing of either an affine or a metric connection: a bare eight- 
dimensional manifold suffices, equipped only with coordinates and yi Q- 
In this manifold a 7-surface S is assumed to exist, whose equation is 

(2.1) H{x\yi) = 0. 

By following Synge's account, we consider a curve T joining the points A 
and B on T,, and the integral 

(2.2) /= [""y^dx'. 



J A 

We propose ourselves to find the extremals of this integral with the varied 
curve always lying on S. To this end a parameter u is introduced, whose 
values at A and at B are fixed for all the curves, and we get rid of the side 
condition (j2.ip by considering the variation of the integral 

(2.3) J= [ {yidx' - XHdu), 

J A 

where X{u) is a Lagrange multiplier. A generic variation of J reads 

(2.4) 5J=[y,5x']'^ 

rB / OH OH 

+ / I 6yidx^ — Sx^dyi — HSXdu — X-—^6x^du — X— — Syidu 
J A V Ox^ oyi 

It is asked that 5J = under arbitrary variations of fe*, 5yi, 5A, provided 
that vanish at A and at B. One finds that the extremals must obey the 
equations 

(2 5) dx^^^dH_ dyi^_^aff 

du Oyi ' du Ox^ ' 

Since the variation does not determine A(m), one can choose the parameter 
u in order to write the equations of the extremals in the Hamiltonian form 

(2.6) dx^^m^ dm^_dH_^ 

du Oyi ' du Ox^ ' 
These equations can be given another interpretation, in terms of a four- 
dimensional manifold whose coordinates are x*. In the new interpretation, 
equation (j2.ip no longer refers to a 7-surface S, but to a set of 3-spaces, 
each one associated to a point x*. We want that the theory be invariant 
under arbitrary transformations of the x-coordinates, hence we must think 
of the yi as a covariant four-vector defined on the manifold. Considered in 
the original eight-dimensional space, an extremal is a curve on S. In the 



-'^Latin indices run henceforth from 1 to 4. 
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four-dimensional manifold whose coordinates are x*, it appears as a curve 
X* = x*(n) with an associated vector field m = yi{u). Let us consider 
from the four-dimensional standpoint the integral (12. 2p along an extremal 
joining the points A{x'^) and B{x^), and call it f{x"',x^). It is nothing but 
Hamilton's characteristic function. If the points A and B are varied and the 
integral for the new extremal is compared to the integral for the previous 
one, since the integral in the second row of (12. 4p vanishes for the extremals, 
one finds 

(2.7) 5/= . 

Provided that 6x'^,5x^ can be chosen arbitrarily, it must be 

/X df df , 

(2.8 ^=yi, = -V i- 

Inserting the first of ()2.8p in (j2.ip produces the Hamilton-Jacobi equation 

df 

(2.9) /?(x\/,0 =0, where = 

If the vector field yi defined by the extremals is such that the circulation 



(2.10) j> = 

for an arbitrary closed curve C lying within some (two, three or four dimen- 
sional) domain D, the extremals form what Synge calls a coherent system 
and are named rays. For a coherent system, the integral 

(2.11) I{A,B) = r f,idx' 



A 

between a fixed point A and a variable point B does not depend on the path 
of integration. The subspace of D that is found when B varies but I{A, B) 
maintains a constant value v is called a wave; by varying the parameter v 
we get a set of waves; changing the starting point A does not change the 
set; it merely changes its parametrisation. 

3. Introducing the metric. Null waves and null rays. 

For associating the quantities of the theory with the quantities of experi- 
ence we need a metric. We assume that Qik is the usual pseudo-Riemannian 
metric of general relativity, that can be locally reduced through a suitable 
transformation to the Minkowski form rjik = diag(l, 1, 1, —1). Then we can 
introduce the timelike world-line of an observer endowed with four-velocity 
u* and define the frequency of the wave with respect to that observer: 

(3.1) V = f,,u\ 

We can also consider whether f^i is spacelike, timelike or null. In the first 
case V can vanish locally, while this is impossible in the second case. One can 
also show t5j that in the first case the phase velocity of the wave is inferior 
to the fundamental velocity introduced by the metric, while the converse is 



4 



S. ANTOCI AND L. MIHICH 



true in the second case. In the third case the wave propagates just with the 
fundamental velocity. Such a null wave obeys the Hamilton-Jacobi equation 

(3.2) H{x\f,) = ^g''lilk = 0. 

Since in this case the first of Hamilton's equations (|2.6p reads 

dx* 
du 



(3.3) ^ = g^'U 



the ray velocity ^ is null: 

(3.4) s,,'^^ = 0. 

The second of Hamilton's equations reads now 

(3.5) '^=-1^:m- 

By eliminating / j in (|3.5p with the use of (j3.3p one finds that 

6 dx* 

(3.6) :rT^ = 05 

du au 



the extremals for the Hamiltonian (j3.2p are null geodesies, with the param- 
eter u of the extremal taking the role of special parameter for the geodesic. 
For a coherent system one has 

(3.7) f,^6x' = 

when Jx* is a displacement along the wave. Due to (j3.3p one obtains 

■ dx'^ 

(3.8) gikSx'-r- = 0. 



du 



Since, again due to (|3.3p 



dx 



(3.9) /^,rr_ = 5*^/j^ = o 

one finds that not only the null ray is orthogonal to the null surface, but it 
lies in it too. The fact that a certain curve lies in a certain surface is however 
an occurrence that can be ascertained without a metric; we shall remind of 
this property in the following, when dealing with photons in dielectrics. 

4. The characteristics of Maxwell's equations for a 
nondispersive, isotropic dielectric 

We define [9] the electric displacement and the magnetic field by the 
antisymmetric, contravariant tensor density H**^, while the electric field and 
the magnetic induction are given by the skew, covariant tensor Fj^. We 
define also the four- vectors: 

(4.1) Fi = Fiku\ H, = Hiku^, 
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where is the four- velocity of the medium. If the latter is isotropic in its 
rest frame, its linear constitutive equation reads [7] 

(4.2) fiH''' = F^^ + (e/i - 1){u'F'' - u^F'). 

Gordon noticed that (j4.2p can be rewritten as 
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(4.3) fiW'=[g''--{ef,-l)uV] 
Therefore he introduced the "effective metric tensor" 

(4.4) a'^ = g'^ - (e^u - l)u'u'' 
and rewrote the constitutive equation (j4.2p as 

(4.5) fiW' = ^a'^a'^'Frs, 

where g = —det{gik). Since after some simple algebra one finds [3] that 

(4.6) a = ^, 

where a = — det((Tjfc), equation ()4.3p can be eventually rewritten as 



(4.7) 



H 



ik 



'-V^a"'a^'Frs. 



Therefore, apart from the factor y^e/yU, the constitutive equation in a nondis- 
persive, isotropic dielectric medium is rendered by the effective metric 
exactly in the way the constitutive equation for vacuum is rendered by the 
pseudo-Riemannian metric gik- If the medium is also homogeneous in its 
rest frame, the factor is constant, and the analogy with the vacuum 

case is complete. Then one can reduce to the vacuum the problem of the 
choice of the Lagrangian for the electromagnetic field in nondispersive, ho- 
mogeneous, isotropic dielectrics. As a consequence, in such dielectrics the 
definition of the energy tensor of the electromagnetic field is uniquely given 
by Gordon's procedure [3], and in this way the general relativistic exten- 
sion of Abraham's form [B] is retrieved. In his seminal paper Gordon, after 
finding the expression of the wave equation for tensorial fields in a pseudo- 
Riemannian space, dealt with the limit of geometrical optics in the footsteps 
of Hadamard [10] and proved that the propagation of light in a dielectric 
that is isotropic and homogeneous in its local rest frame must obey, besides 
an equation that links the amplitude A and the phase /, the equation 

Sk . 



(4.8) 



where cr*'^ is the effective metric (j4.4p . To this equation Gordon applied the 
Hamiltonian method of Section 3 to prove 0, inter alia, that "the rays in 
moving bodies are represented through null geodesies of the manifold with 
the line element da"^ = aikdx^dx^^\ 



^See Ref. [S], page 455. 
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Gordon's proof applies to homogeneous dielectrics. In 1957, by studying 
the characteristics of the Maxwell-Einstein equations [4], Pham Mau Quan 
extended the validity of ()4.8p to a nonhomogeneous medium. 

5. Photons in the vacuum of general relativity 

In two remarkable papers pLi |2j Kermack et al. and Synge have shown 
that the concept of photon in vacuo is by no means an exclusive outcome 
of quantum theory, but is deeply rooted in classical general relativity. We 
have already availed [11] of Synge's proof of the proportionality of energy 
and frequency for an entity endowed with both the behaviour of a null wave 
and of a null particle, exchanged between two atoms in arbitrary states of 
motion in a gravitational field. We have rather reproducing it here, since it 
is necessary for the discussion of photons in dielectrics from the Hamiltonian 
viewpoint. 

Atoms are dealt with as pointlike entities. In Figure 1 are shown the 
world-lines of an emitting and of an absorbing atom, joined by a null line 
AqA. We attribute to the photon the Hamiltonian (j3.2p : viewed as a point 
particle, its equations of motion are ()3.6p and ()3.4p . 




Figure 1. Spacetime diagram for two atoms exchanging 
Synge's photon. 



We endow this point particle with a momentum-energy vector that is 
tangent to the photon's world-line and is parallelly propagated along it: 

dr* SM^ 

where 6, assumed to be positive, happens to be a constant thanks to (j3.6p . 
Since it travels with the fundamental velocity, the photon must have zero 
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proper mass, hence: 

(5.2) MiM' = 0. 

An atom is instead pictured as a timelike point particle endowed with the 
momentum-energy vector 

dx* 

(5.3) mou* = mQ—-; 

as 

niQ is the atom's rest mass or rest energy, while is its four- velocity. The 
change of the momentum-energy vector of the atom due to absorption of a 
photon is supposedly ruled by the conservation law: 

(5.4) m'ou' = mou' + M\ 

where ttiq is the rest mass, and u'^ is the four-velocity of the atom after the 
absorption of the photon. By rewriting the conservation law in covariant 
form and by multiplying the two forms term by term one finds 

(5.5) - m'l = -ml + 2moMiu\ 
i.e. 

(5.6) = 

Since for optical processes |m'o — mo\/mo <^ 1 Synge defines the energy E 
of a photon with momentum-energy vector M* relative to an atom endowed 
with four-velocity as 

(5.7) E = -Miu\ 

This scalar quantity is numerically equal to the fourth component of M* in 
a locally Minkowskian coordinate system where the atom is at rest. 

The special parameter u along the null geodesies joining the world- lines 
of the absorbing and of the emitting atoms can be given a single starting 
value and a single terminal value on these lines, since the geodesic equation 
is invariant under the linear transformation u' = au + b, where a and b 
are constants. Under this transformation the Hamilton equations (13. 3p and 
()3.5p are preserved too, provided that the yi undergo the corresponding 
transformation. The latter does not affect the structure of the waves, but 
only their parametrisation. Let v be the parameter introduced at the end of 
Section 2, associated to the waves after this reparametrisation has occurred. 
We can write 

d f ^ dx'^ \ 5Mi dx^ ^ 6 dx^ 
du \ ^ dv J 5u dv ^ 6u dv 



We have also 



6 dx^ dx'- 5 dx^ 



5u dv dudv ' dv du 6v du 
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where F^^ is the ChristofFel connection built with gik- Since 6Mi/5u = 0, 
the right hand side of (15. Sp can be rewritten as 



(5 10) M- ^ '^^^ — 9 ^ — ^ 9 ^ ( dx''\ _ ^ 

^ Su dv * du Sv du 2 dv \ ^ du du 

Hence one finds 

and, with reference to Figure 1: 
(5.12) =fM.* 



Let Uq, be the four- velocities of the atoms at Aq and at A respectively. 
If dv is the infinitesimal increment of the parameter v when going from the 
null surface joining Aq and j4 to a neighbouring one, we can write: 

(5.13) — — dv = Undso, -TT- dv = u^ds, 
\9vJao \dvJA 

hence from (15.120 we get 

(5.14) (^^^')ao = 

and the definition (|5.7p of the energy of a photon absorbed (or emitted) by 
an atom allows rewriting the last equation as 

(5.15) E^dsQ = Eds. 

From the wave model of the photon implicit in Hamilton's formulation in 
a coherent system one gathers that, if dsQ and ds are the intervals during 
which the emission and the absorption of the wavelike photon takes place 
respectively at the atoms and A, it must be: 

(5.16) i^adsQ = vds, 

where i^q and u are the scalar frequencies of emission and of absorption 
defined by (|3.ip . By dividing term by term this equation and equation (|5.15p 
we get: 

/ N En E 

(5.17) — = -. 

I/O 1/ 

Therefore, Hamilton's theory allows to show that if a photon is emitted by 
an atom and absorbed by another one in presence of a gravitational field, 
the ratio energy /frequency is the same for emission and for absorption. This 
ratio is independent of the behaviour of the gravitational field and of the 
state of motion of the two atoms. 



"^The value of the scalar frequency (|3.ip depends on the parametrisation, but this fact 
affects both sides of equation (|5.16p in the same way. 
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6. Timelike photons in nondispersive, isotropic dielectrics 

Thanks to Gordon's effective metric (|4.4p and to tlie corresponding Hamil- 
tonian (14. Sp it is immediate to extend Synge's proof to nondispersive, iso- 
tropic dielectrics. The diagram in Figure 1 appUes also in this case, and the 
(m, v) parametrisation remains unaltered. The metric 

(6.1) CTik = gik - ( — - A UiUk 

takes in the Hamiltonian the role that gik had in the previous Section, but of 
course also the latter shall intervene: it is the real metric of spacetime, and 
e.g. the line elements dso and ds entering the four-velocities of the atoms 
and of the medium must be measured by gik- We define the null geodesies 
of the effective spacetime as 

(6.2) ??^-?? + Sm^?^ = 0> 
ou au dw^ du du 

where 

, . dx* dx^ 

(6.3) aik-. — = 0, 

du du 

and the Christoffel connection to be used is now 

(6.4) = -O-*™ {(7mk,l + (^niLk " (^kl,m) ■ 

Gordon's reduction to the vacuum imposes to model the photon as a null 
particle of the effective spacetime, and to write 

dx* 

(6.5) M^ = e^,^ = o, 

du du 

where 6, assumed to be positive, happens to be a constant thanks to (j6.2p . 
Since the photon travels with the fundamental velocity of aik we shall pose 

(6.6) aikM'M'' = M^,)M' = 0, 

where we have adopted Gordon's convention of enclosing in round paren- 
theses the indices moved with the effective metric. We can again model 
the interaction of the photon with the atom through the equation uiqu'^ = 
thqu^ + M*. Since 



5jfc + ( 1 - ^ ) UiUk 



(6.7) aikM'M'' 



one finds 



(6.8) gikM'M^ = MiM' = ^i- _ 1^ (uiM* 
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Since e/i > 1, when measured with the real spacetime metric, the momentmn- 
energy of the photon turns out to be timeUke, as it occurs to an ordinary 
particle, for which a rest coordinate system exists. Therefore one finds 

(6.9) - m'l = -ml + 2moMiu' + " {uiM')'^ ; 

instead of (|5.5p . but for optical processes the last term is negligible when 
compared to the remaining ones, and one can still define the energy of the 
photon as in equation (j5.7p . The argument of the previous Section can now 
be repeated with the due changes, by starting from the quantity 

d / dx^\ _''6M(^dx' ^ "Sdx' 
du \ dv J 5u dv Su dv 

One obtains 



du \ ^ ' dv _ 
hence, again with reference to Figure 1: 

By the same argument as in the previous Section: 

(6.13) (^W^Oao = (^{i)^')A 

but, because 



(6.14) M(,)n 



S'jfc + ( 1 - — ) UiUk 



M'u 



MiU^ 



instead of (I5.14|) one finds 



(6.15) dso=(^\ ds. 

V Jao V e/^ J A 

Since equation (j5.16p remains unaltered, instead of (|5.17p one eventually 
obtains 



7. Spacelike photons in nondispersive, isotropic dielectrics 

The photon that we have found through the reduction to the vacuum has 
the unquestionable virtue of being a timelike entity for which a rest sys- 
tem exists and, as far as a homogeneous medium is considered, it exhibits 
the proportionality of energy and frequency that a good-natured photon is 
supposed to possess. But, as soon as a nonhomogeneous medium is consid- 
ered, the proportionality disappears, and we are left wondering whether we 
did some wrong, or whether there is some other opportunity that we have 
missed. 
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The Hamiltonian point of view tells us that the latter is the case. In 
fact, with ()4.8p there is an ambiguity in the choice of the momentum-energy 
vector, that is not apparent in the vacuum case. Let us consider the first of 
Hamilton's equations (12. 6p . In our case it reads: 



Let 

(7.2) pk = ef,k 

be the covariant version of the momentum-energy vector; it conforms to the 
usual Hamiltonian prescription for material particles. Then 

(7.3) = a^V- 

In the vacuum case, M* and pk are the contravariant and the covariant 
forms of one and the same entity, whose norm is null. When e/x > 1, M* 
and Pi are expressions of different entities: the first one is a timelike vector, 
while the second one is spacelike. Let us assume that not M*, but pi is the 
momentum-energy of the photon, by definition the one that is exchanged 
with an atom in the absorption and emission processes. Instead of (j5.4p we 
shall postulate 

(7.4) rn'oii- = moUi + pi. 

If this equation is rewritten in contravariant form and the two forms are 
multiplied term by term one obtains 

(7.5) - m'l = -ml + 2mQPiU^ + pip\ 

For optical processes one can neglect the last term and define 

(7.6) E = -piu' 

as the energy exchanged by the photon. Due to (j7.3p one writes 



(7.7) Miu' = Uka^'pi = Uk - (e/i - l)u^u^ 

hence, instead of the relation (j6.16p . E fulfils 

, , .'E\ fE 

(7.8) 



pi = e^ipiu\ 



8. Conclusion 



While general relativity in vacuo contains a satisfactory model of the pho- 
ton as a null particle whose world-line lies in a null wave, and for which the 
proportionality between energy and frequency holds, the situation is much 
less encouraging if one attempts defining, by general relativistic methods, a 
photon in nondispersive, isotropic dielectrics. 

The Hamiltonian viewpoint shows that in such dielectrics the photon 
should be a pointlike entity associated to the ray of geometrical optics, that 
in its turn is permanently associated to a given phase wave, but it shows 
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also that one must choose between two evils: either maintaining the propor- 
tionality between energy and frequency at the cost of associating a spacelike 
momentum to an entity that travels along a timelike ray, or insisting on the 
natural association of a timelike momentum to the timelike ray at the cost 
of abandoning, when the dielectric is nonhomogeneous, the proportionality 
of energy and frequency. The old dilemma, born with the electromagnetic 
energy tensors of Minkowski [7] and Abraham [6] and still awaiting for an 
experimental resolution [12j , thanks to the methods of Gordon ^ and Synge 
[2] appears here in a novel form. 
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